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Abstract —In this paper we consider the identification problem 
of Cellular Automata (CAs). The problem is defined and solved 
in the context of partial observations with time gaps of unknown 
length, i.e. pre-recorded, partial configurations of the system at 
certain, unknown time steps. A solution method based on a 
modified variant of a Genetic Algorithm (GA) is proposed and 
illustrated with brief experimental results. 


I. Introduction 


CAs present an attractive and effective modelling technique 
for a variety of problems. In order to use CAs in a practical 
modelling task, one needs to understand the underlying rules, 
relevant to the given phenomenon, and translate them into a 
CA local rule. Additionally, the state space, tessellation and 
neighborhood structure need to be pinned down beforehand. 
This narrows the application area for CAs, since there are 
problems for which it is hard to manually design a proper 
local rule. In some cases only the initial and final states of 
the system are known ( e.g. |Q—13|). Such problems motivate 
the research on automated CA identification. Various methods 
have been used, including genetic algorithms (GAs) 143-171, 
genetic programming |8]-JT0|, gene expression programming 
CD- ant intelligence p2| , machine learning [T3|, as well as 
direct search/construction approaches G3-Q3- 

Existing methods can be divided into two main groups. 
Firstly, methods for solving specific, global problems. An 
example of such a problem is majority classification in which 
one only knows the initial condition and the desired outcome. 
Secondly, methods that exploit the entire time series of config¬ 
urations, where it is assumed that all configurations are known. 
Only limited research efforts have been devoted to problems 
involving identification based on partial information |4j. 


The main goal of the research presented in this paper is 
to develop methods capable of automated CA identification in 
case of partial information. The paper is organized as follows. 
In Section [II] we start with introducing basic definitions and 
presenting some well-known facts on CAs. Section III holds 
the formal definition of the CA identification problem, while 
in Section [IV] we reformulate this problem as an optimization 
task. In Section [V] the evolutionary algorithm for solving the 
identification problem is presented. The paper is concluded 
by Section [VI] which presents initial results of computational 
experiments. 


An introduction to the methods presented in this paper, and 
a simpler formulation of the discussed algorithm can be found 

in (T8J. 
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II. Preliminaries 

We start by defining a CA. In this paper we will concentrate 
on ID, deterministic CAs with a symmetric neighborhood. 

Let r £ N and f a : {0, l} 2r+1 -A {0,1} be any 
function, then for N > 0 we define the TV-cell global CA 
rule A n : {0,1}^ -A {0,1}^ as: 

-4iv(* . . , 5^, . . .) = (* • * , f A^i—rt • • • i ^z+r)? • • •)> (1) 

using periodic boundary conditions, i.e. for any i £ Z it holds 
that s i+N = Si. 

The function Ja used in this definition will be referred to 
as a local rule, and the integer r will be referred to as the 
radius of the neighborhood. Any local rule can be uniquely 
defined by a lookup table (LUT) that lists all of the possible 
arguments together with the corresponding function values. It 
is assumed that the arguments are listed in a lexicographic 
order. The general form of such a LUT in the case of radius 
r = 1 is shown in Table U 

TABLE I. LUT OF LOCAL RULE R = (ls,h, le,h,U, h,h,h)2 
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110 
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100 

Oil 

010 

001 

000 

h 

h 

h 

h 

u 

h 

1*2 
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The LUT can be used to enumerate local rules, as the coef¬ 
ficients U can be treated as digits in the binary representation 
of an integer R, i.e. R = Xw=i U 2 Z_1 - Clearly this extends to 
higher values of the radius. Due to the fact that the ordering 
of arguments in the LUT is fixed, only the second row needs 
to be stored, such that a LUT may be represented as a binary 
vector. The length of such a vector is 2 2r+1 . 

With {0,1}* we will denote the set of all binary sequences 
of finite length, i.e. {0,1}* = U 1 v>o{^A} jV - The function 
A: {0,1}* —> {0,1}*, satisfying A(X) = An(X) if and only 
if X £ {0, 1} jV , where each of the global rules A .y is defined 
with the same local rule f a , will be referred to as a generalized 
global rule of a CA. Such functions will be frequently used 
throughout this paper, therefore we will simply refer to them 
as global rules or rules. In this paper a CA will be identified in 
terms of its global rule, and by referring to a CA we therefore 
always refer to its global rule in this generalized sense. 

Note that rule A is uniquely defined by a given local 
rule f a. but the opposite is not true. For a given rule A we 
may find different local rules defining it. Fact [T| highlights 
the relationship between different local rules defining the 
same CA. 













Fact 1. Two local rules f: {0, l} 2r+1 —► {0,1} and 

g: {0, l} 2 “ _,_1 —► {0,1}, u < r, define the same CA if and 
only if it holds: 

f 1, ■ • • 5 ^2 r+l) = g(s r —u+ 1; • • • ; tt r J rU J r \ ), (2) 

for any (si,..., s 2r +i) £ {0,1} 2? ' +1 . 

Example 1. Let g : {0,1} —> {0,1} be defined by g(s) = s 
and /: {0, l} 3 -> {0,1} be defined by /(si,s 2 ,s 3 ) = s 2 s 3 + 
s 2 (1 — s 3 ). We can see that for any s 1; s 3 £ {0,1} it holds 
that /(si,s 2 ,s 3 ) = <?(s 2 ) = s 2 , and thus / and g define the 
same CA rule, which happens to be the identity rule. □ 

For a given neighborhood radius r, A r denotes the set of 
all CAs that can be expressed with the use of a local rule with 
a neighborhood of radius r. CAs belonging to A\ are referred 
to as Elementary CAs (ECAs), and form the most commonly 
studied class of 2-state CAs ID- 

Two important properties of the sets A r are underlined in 
Factd 

Fact 2. For any r > 0, A r C A r +i and \A r \ = 2 2 ~ r+ \ 

Let A be a CA, X £ {0,1} M for some M and T > 0. 
The finite sequence of vectors given by: 

(X,A(X),A 2 (X)),...,A t ~ 1 (X)), 

where A t denotes the /—th application of the mle A, will be 
referred to as the space-time diagram containing T time steps. 
Each of the elements of a space-time diagram will be referred 
to as a configuration of the CA, while the first element will be 
referred to as the initial configuration. If t = 0,1,..., T — 1 
and m = 1,..., M, then A t (X)[m ] refers to the state of the 
m-th cell in the i-th element of the space-time diagram. 
Example 2. We consider an ECA defined by local rule 150. 
The LUT of ECA 150 is shown in Table UB 


TABLED. LUT OF ECA 150 
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0 

0 

1 

0 

1 

1 

0 



Fig. 1. Space-time diagram of ECA 150 

belonging to the set {0,1}. Additionally, let the first row I[l] £ 
{0,1} M . Such an array I will be referred to as an observation. 
If an observation I does not contain the symbol ?, we refer to it 
as spatially complete. The first row /[ 1] is assumed to represent 
the initial configuration of a CA, and row I[n] for n > 1 
represents the configuration at time step r„. It is assumed that 

t~n ^ fn+ 1 * 

Let I be an observation. The number C(I) = ff{I[n , to] 

?} will be referred to as the number of completely observed 
states. In our case, for any observation I it holds that C(I) > 0. 

For each observation I, we define the set com(/) that 
contains all of the spatially complete observations I', satisfying 
I'\n : to] = I[n,m\ for all n,m such that I\n,m] ^ ?. 
Example 3. Let observation I be given by: 
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0 

? 

1 
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1 
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Then the set com(f) is given by: 


Figure [l]depicts a space-time diagram of ECA 150, starting 
from a random initial configuration. Following a common 
convention, the space-time diagram is visualized as a bitmap in 
which every row corresponds to a configuration at specific time 
step. The first row in the image is the initial configuration. State 
one is drawn as a black pixel, while white pixel corresponds 
to state zero. □ 


III. Problem statement 

In this section we define the identification problem. The 
formulation presented below is based on the concept of an 
observation of a space-time diagram, which is assumed to 
be incomplete, i.e. it contains only partial information on the 
states of the CA. 

Formally, we assume that the states of a system, which 
is assumed to be an unknown CA, were observed at certain, 
unknown time steps. Let I be an N x M array containing 
symbols belonging to the set {0,1,?}, where the symbols 0 
and 1 denote valid states, while ? denotes an unknown state 


com(/) = 
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As can be easily counted, C(I ) = 7. 


□ 


We will say that a CA A fits the observation I if and only if 
there exists an I' £ com(J) and a sequence of natural numbers 
(r ra ) such that r n < T n+ 1 and for any n £ {1, 2,..., N — 1} 
it holds: 

A T -(I'[l}) = I'ln+1}. (3) 

Proposition 3. Rule A fits the observation I if and only if there 
exist an I' £ com(/) and a sequence of natural numbers ( t n ) 
such that t n < t n+ 1 and for any n £ {1,2,..., A7 — 1} it 
holds: 

A t -(I'[n]) = I'[n + 1], (4) 











































The sequence (r„) in the definition of fitting, corresponds 
to the time steps in the CA evolution (which are assigned 
to the rows of the observation), while the sequence (t n ) in 
Proposition [3] refers to the number of missing time frames 
between two consecutive rows in the observed diagram. Ob¬ 
viously T n =YJi=lti- 

In practice, it is useful to be able to use more than one 
observation for the identification. Therefore, we will consider 
observation sets X containing a finite number of observations. 
For simplicity, we assume that the elements of X are numbered, 
i.e. X = {Ji,..., J|x|}. We will say that rule A fits the 
observation set X , if it fits all of the observations in the set. 

Note that for the sake of simplicity we will write C(X) to 
express the number of observed states in all of the observations 
belonging to I, i.e. C(X) = Yhi^z C(-0- Additionally, we will 
write M(X) to denote the total number of columns in all of 
the observations belonging to X, i.e. M(X) = ’Yhiaz Mi where 
Mi is the number of columns of observation 7. 

For a non-empty observation set X , the set TZ(X) will 
denote all CA rules that fit the observation set X. The iden¬ 
tification problem is defined as finding the elements of the 
set 1Z(I) based on X. In practice, our goal will be limited to 
finding at least one of the elements of 'R(X) n A r for some 
r > 0. The problem can also be seen from the machine learning 
perspective in which the observation set is a training set, from 
which we try to leant and build a set of rules based on this 
knowledge. 

The following fact will be used in the design of the 
identification algorithm, to simplify calculations. Informally, 
it could be expressed by understanding the observation set X 
as a set of conditions that the rule needs to meet. Having fewer 
conditions, it becomes more likely to find solutions meeting 
those conditions. 

Fact 4 . Let X be an obseri’ation set, and let X' C X. Then 
U{X) C K(X'). 

Since we consider only finite observation sets, we know 
that for every observation set X there exists a T > 0 such 
that, if a solution exists, and it l n ) is the time gap sequence 
of observation I £ I, then 1 < t J rl < T, for every n. In the 
construction of the solution algorithm, we will assume that an 
upper-bound for T is known. 


IV. CA Identification as an optimization problem 


The identification problem, defined in Section III can be 
formulated as an optimization problem, which in turn enables 
the use of evolutionary search methods. 


We start with an auxiliary definition. Let a, b £ {0,1, ?} M 
be some vectors. We define the distance between a and b as: 


dist(a, b) = E \ai~bi\. (5) 

We assume that if there is no i such that a,i ^ 1 and bi ^ 1 
then dist(a, b) = 0. Therefore dist(a, b) = 0 a = b. 

Assume that I is a set of observations of some unknown 
CA belonging to A r , i.e. TZ(X)UA r f 1 0- Let A be a CA, and 


for every I £ I, let (r r () be a strictly increasing sequence of 
natural numbers. 

As a start, we define the error measure Ez(A , (r/)), which 
measures how well a given CA A fits the observation set I , 
assuming that r/ is the time step of the /-th row in observation 
I. The measure Ez is defined as: 

Nj-l 

E x (A,(t!)) = J2 E dist(A T "(/[l]), J[n + 1]), (6) 

/ex n—l 

where Ni is the number of rows of observation I £ X. The 
following fact is an direct consequence of the definition of the 
identification problem. 

Fact 5. A £ 1Z(I) if and only if there exists a sequence (r/) 
such that Ez(A, (r/)) = 0. 


Note that in the case when X = {7} we will write Ei 
instead of E{jy 

Let ( ti ) be a sequence of natural numbers, and let A be a 
CA rule. Observation /jW defined as: 




I[n,m] L 

A^(If u) [n-l])[m], 


if 7[n, to] ^ ?, 
if J[n, to] = ?, 


will be referred to as the A-completion of I with_time gaps 
(ti). Note that any observation 7 satisfies /[l] = 7^ 1] for 

any A, (tf). 


Fact 6. 12, £ com(7). 

Example 4. Assume that CA A is ECA 150 with LUT given by 
Table |h| and local rule / 15 q. Let , = (1,2). We consider 
the observation I defined in Example pi and compute /)] .. 
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The calculation is as follows. Firstly we compute 7^[2,2]. 
Since t-\_ = 1 we simply apply the rule to the first row of 
/, i.e. J£)[2,2] = /i5o(/[l,l],/[l,2],/[l,3]) = 1. Since 
t 2 = 2, to find /)'] , [3,3], we first need to compute one 
additional configuration by evaluating the rule on configuration 
-j[2], It is easy to check that A(I^. 2]) = (0, 0,0), and thus 
7 ^ 13 , 3 ] =/ 150 ( 0 , 0 , 0 ) = 0 . 1 □ 


Based on Proposition [3] we define an alternative error 
measure Ez(A , (t{)) that will turn out to be more useful in 
the construction of the solution algorithm. Assuming that (t{) 
is a sequence of natural numbers representing time gaps, Ez 
is defined as: 


AT/-1 

Ez(A, (tf)) = E E dist (^"(^)W)^W)[^+ 1 ])- 

/ex n= 1 

(7) 

Since 7^ , £ com(7), we can express Ex without using the 
function dist as: 

Nj — l 

Ez(A, (tl)) = E El^( J &W)-%)[ n+ 1 H- (8) 

lex n—l 













Example 5. We refer again to observation I, CA A and (t,) 
used in Example [4] and we compute the error measures Ej and 
Ej. Let us start with Ej. Following the fact that r n = U, 
we get (ly) = (1,3). The error measure Ei can be computed 
easily by evolving A, starting from the initial configuration 
/[l] and comparing the results with the values in I, for entries 
not occupied by ?. 

Starting from the top: A(I[1]) = (1,1,1). Since t± = 1 we 
compare the outcome with the second row of I. As we see, 
I[ 2,1] = 0 ^ 1 has an incorrect value, /[2, 2] =? so it does 
not contribute to the error and /[2, 3] = 1 which is a correct 
value. Since T 2 = 3 we should further evolve A three times, 
starting from A(/[l]), but since A((l,l,l)) = (1,1,1), we 
can simply compare J[3] with (1,1,1) and see that no errors 
occur. Summing up, the total error is: Ej(A , (r,)) = 1. 

Similarly, we find the value of Ej , by taking pairs of rows 
Iy. )[tt,] and I[n+ 1 ] and comparing the results of A tn (I^ ^[n]) 
and I[n+ 1]. The error in the first pair of rows is the same as 
in the case of Ej. For the second pair the initial condition is 
/)) , [ 2 ] = ( 0 , 1 , 1 ), and since A(0, 1 , 1 ) = ( 0 , 0 , 0 ) and since 
A((0,0,0)) = (0,0,0), we do not further evaluate A. We 
compare ( 0 , 0 , 0 ) with /[3], which yields 2 incorrect values. 
Summing up, the total error is Ei(A , ( ti )) = 3. □ 

The relation between Ex and Ex is expressed by the 
following proposition. 

Proposition 7. Let A be a CA rule and X an obsen’ation 
set. There exists a strictly increasing sequence (r/) of natural 
numbers, such that Ex(A , (r/)) = 0 if and only if there exists 
a sequence (t{) of natural numbers such that Ex(A , ( t \)) = 0. 

As a consequence of Proposition [7] the identification prob¬ 
lem can be expressed mathematically as the minimization of 
E. Note that this is only possible due to the assumption that 
observation set X contains partial space-time diagrams of some 
unknown CA. In a more general setting, where the observations 
could have a more complex origin, such a simplification is not 
possible. 

As mentioned earlier, we consider the case where the upper 
bound for the time gaps is known. Using this knowledge, we 
define the error measure Ex independently of the selection of 

(tf) as: 

E X {A)= min E X (A, (ff)). (9) 

l<tf<T 

Note that the minimum in (9j) is always defined, since 
there is a finite number of possibilities for the choice of t I n . 
Additionally, note that for a spatially complete observation /, 
the choice of t : n is independent of the choice of t r m for any 
n 7 ^ to, and for observations I and J, the choice of (tj) is 
independent from the choice of (tf). Consequently, to find the 
value of Ex in the case of a spatially complete observation 
set, we need to examine at most J2ipx T (iVj — 1 ) sequences 
of time gap lengths. 

In the general case, the choices of the values of (if) are 
dependent on each other, and thus in order to find the exact 
value of the error measure we need to examine all of the 


rp Ni ~i possibilities, which holds a substantial computational 
burden. Due to this, even in the case of partial observations, 
we follow the approach described above and treat the time 
steps independently. The only difference that we introduce 
is that if for given n, few different candidate values for if 
lead to the same, minimal value of the pairwise error, one of 
those candidates is being selected randomly. Such an approach, 
is a stochastic overestimation of the error, i.e. the calculated 
value will never be lower than the actual error. Additionally, 
if a given CA is a solution to the problem, recalculating 
the approximate error measure multiple times increases the 
probability of finding the exact value, which is found by taking 
the minimum of all of the obtained results. Such an approach 
turned out to be sufficient in the discussed context. 

V. Evolutionary algorithm 

Having stated the identification problem as an optimization 
problem in this section, we describe its solution using an 
evolutionary algorithm based on the classical GA. In order 
to follow the GA approach, we need to define the individuals’ 
representation, the population structure, a fitness function for 
ranking the individuals, but also the selection procedure for 
reproduction, and finally the cross-over and mutation operators. 
Formally, also halting conditions need to be formulated. 

A. Representation of individuals and population structure 

Here, the individuals that make up the population are CAs, 
encoded through the LUT of their local mles, which is possible 
since the LUT of any CA A £ A, can be represented as a bit¬ 
string of length 2 2r+1 . We assume that the population consists 
of CA belonging to A r , for some r > 0. 

We consider populations of P > 0 individuals. By V 1 we 
denote the population of the ?-th generation of the GA. The 
population V 1 is the initial population, and is constructed by 
randomly selecting P bit-strings. Populations V for i > 1 are 
the outcomes of applying the genetic operators, according to 
the rules described in the remainder of this section. 

B. Fitness function 

The fitness function is directly related to the error measure 
Ex defined by ([bji. Although Proposition [j] states that the 
error measures given by and <[9]> can be used interchange¬ 
ably, preliminary experiments showed that the later results in 
efficient and convergent algorithm, while suboptimal results 
were obtained using the measure given by (|6j. This follows 
from the fact that the error in row n is affected by errors 
appearing in rows 2,..., n— 1. As we know from the research 
on dynamical properties of CAs, small initial perturbations can 
strongly affect the final system state (20). For that reason, it 
is easier to optimize Ej with a GA as compared to Ej. 

Let L e {0,1} 2 + be a LUT of some local rule which 
defines a CA A. Then fitx(L) denotes the fitness of A, and is 
defined as: 

fit x(L) = C(X) - M(X) - E t (A). (10) 

The fitness function takes integer values from 0 up to C{X) — 
M(X), i.e. there are finitely many possible values of the fitness 
function. The goal of the GA is to maximize fitness, and a CA 



with a maximal fitness value is a solution of the identification 
problem. From the above, it is clear that if C( X) — M(1) 
is close to zero, solving the problem is infeasible, since the 
number of possible values is very small and the population 
is not able to gradually increase its fitness. Additionally, if 
C{X) = M(I), then the problem is trivial because every CA 
is a solution. 

The fitness defined by ( p~(T| > has proven to work effectively, 
but the computing time needed for its evolution becomes 
unacceptable if the observation set is large. Therefore, during 
the evolution, to estimate the value of fitx we use fitx' for 
some non-empty subset X' C I. We start by randomly selecting 
elements for the subset X' . Subsequently, but before evolving 
a new population we replace one of the elements in the subset 
X’ with a randomly selected observation from X. Due to Fact [5] 
we are sure that such an approach does not result in reducing 
the solution set. 


C. Selection operator 

Having defined the fitness function, we can define the 
selection operator, which is responsible for selecting the parent 
individuals that will be used to produce the next generation. We 
use a random selection method where the selection probability 
of a given individual is proportional to its fitness. Individuals 
are selected with replacement, i.e. individuals might be se¬ 
lected multiple times for reproduction. 


D. Cross-over operator 

To produce offspring, we select two parents according to 
the procedure described in Subsection V-C A uniform cross¬ 
over operator is used, i.e. if L\, denote parents, the outcome 
of the cross-over operator is a vector L c with values that are 
randomly selected from L\ and L 2 , i.e. P(L c [i] = Li[i\) = 
P(L c [i\ = L 2 [i}) = 0.5. 


E. Mutation operator 

Finally, the offspring individual is mutated. A simple bit- 
flip mutation is being used, i.e. for every position of the 
vector a decision is made whether or not the value should 
be flipped, with pf being the probability of flipping the value. 
The expected number of flipped positions in the population is 
Pp f 2 2r+1 . 

F. Elite survival 

After evolving a new population, the elite survival pro¬ 
cedure is applied. Our experiments proved that such an 
approach is required to reach convergence. The procedure 
is implemented by a deterministic selection of Pe P 
fittest individuals from the previous population used to replace 
randomly selected individuals in the newly evolved one. 

Including this elite survival process can dramatically in¬ 
crease the performance of the algorithm, though there are cases 
where such an approach causes the population to progress 
towards a local optimum. To overcome this, we apply a simple, 
adaptive procedure that deactivates elite survival in cases 
when the maximum fitness value of the population remained 
constant for more than N 0 g- generations. The elite survival 
procedure is again switched on after a predefined number of 
N on generations, or if the maximum fitness improved. 


G. Halting conditions 

The algorithm evolves by generating populations according 
to the procedure described above until a maximum, predefined 
number of populations A was evolved or, if a CA that fits the 
observation set was discovered. 


As mentioned in Subsection V-B| during the evolution, the 
fitness fitx is approximated by fitx' for some X' C X , which 
is effective for selection, but can not be used in the halting 
condition since fitx'(A) = C{X') — M(I') does not imply 
fitx(A) = C(I) — M(X). Therefore, for the individual A with 
the highest value fitx'(A), we additionally calculate fitx(A) 
and base the halting condition on it, i.e. the algorithm stops as 
soon an element is found. 


VI. Results of experiments 

By means of our experiments we verified to what extent 
the partiality of observations affects the efficiency of the GA 
in terms of the number of GA iterations required to find a 
solution. 


We concentrated on two ECAs: 150 and 180, with LUTs 
given in Table [D] and III respectively. 


TABLE III. LUT OF EC A 180 
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In this experiment, the GA evolution is based on obser¬ 
vation sets Xa{1A) for k = {0,1,..., 150} and ECA A G 
{150,180}. The integer k will be referred to as the problem 
number. The observation set Xa{ 0) is a set of Cl > 0 obser¬ 
vations obtained from V. different, random initial conditions 
common for both A, by selecting subsequent configurations 
of ECA A generating time gaps of random length from 1 
to T. The set 2A(fc) for k > 0 is built from observations 
belonging to XA(k — 1) by modifying them in such a way 
that 7 r = 2000 randomly selected, completely observed entries 
are replaced by “?”. In other words, by increasing k the 
effect of spatial partiality is increased. As a result of such a 
procedure we obtained a series of observation sets (/yi(fc)) fe _ 0 , 
for which it holds C(lA(k)) — C(lA(k + 1)) = 7r. The 
identification algorithm was then executed for each of the 
obtained observation sets. 

Given that the family of ECAs contains only 256 members, 
the identification problem would be relatively easy to tackle, 
so we set the radius r = 2, i.e. the population contains local 
rules with radius r = 2 represented as bit-strings of length 
32. Without this modification the algorithm is able to find a 
solution in a few iterations, by examining the entire search 
space. 

In order to account for the stochastic nature of the GA, 
the experiment is repeated L > 0 times for each r, k. The 
values of of the GA parameters used in our experiment setup 
are shown in Table [IV] 

The results vary significantly depending on the rule in 
question, which is not surprising since the dynamics of ECAs 
150 and 180 is different. The normalized Maximum Lyapunov 
Exponent (nMLE) (2Tj-{23) of the former is the highest among 
















TABLE IV. Parameters used in the experiment 


param 

value 

description 

r 

2 

rule radius 

pf 

0.01 

probability of flipping 1-bit in mutation 

P 

512 

number of individuals in population 

Pe 

32 

elite size 

T 

10 

bound for the time gap length 

c 

69 

number of rows / columns in each observation 

a 

64 

number of observations 

7T 

2000 

number of cells being removed from each observation set 

S 

8 

number of samples for fitness approximation 

A 

5000 

maximal number of the GA populations 

L 

20 

number of repetitions of the GA per rule 



Fig. 2. Space-time diagram of ECA 180 

all of the EC As, and thereby this CA’s behavior may be 
considered complex. In contrast, the nMLE of ECA 180 is 
only approximately 0.48, which hints that, in some sense, the 
behavior of this ECA is simpler than the one displayed by ECA 
150. The differences in the overall dynamical complexity of 
these two CAs can be acknowledged by examining their space- 
time diagrams, which are depicted in Fig. [I] and Fig. [2] 

To understand the performance of the GA, we first checked 
for which k the algorithm was able to find a solution (Fig. 
|3j. When comparing the plot for ECA 150 with the one for 
ECA 180, it is clear that the identification problem turned out 
to be much more challenging for ECA 150. Indeed, for this 
ECA, the algorithm was effective only if the number removed 
observation elements was smaller than 50 7r, whereas it mostly 
failed when more spatial partiality was added. Besides, even 
for k close to 0, not all of the GA executions were successful. 
In contrast, identifying ECA 180 was always possible for k < 
120 , but for k > 120 we see a sudden drop in the performance. 
Note that in both cases, for k = 150 a solution was easily 
found, since for this setting the problem is trivial, i.e. almost 
all CAs can be considered a solution. 

The above results suggest that, depending on the dynamical 
characteristics of the CA in question, the maximum allowable 
number of missing elements in the observations differs. Further 
research is undertaken to better understand the link between 
the identifiability and dynamics of CAs. 

Figure [4] depict the minimum, average and maximum 
number of GA iterations among the runs resulting in a solution 
for ECA 150 and ECA 180, respectively. In the case of ECA 



problem number k 
(a) ECA 150 



problem number k 
(b) ECA 180 

Fig. 3. Number of successful GA executions 

180, we see that the efforts needed for finding a solution grows 
as k increases, up to the point where it becomes impossible. 
Furthermore, we see that in most cases the difference between 
maximal and minimal values is relatively low. In the case of 
ECA 150, the results are much less stable. The differences 
between maximal and minimal values are substantial, and the 
efforts needed to find the solution do not steadily grows with 
the growing spatial partiality. The only similarity between the 
two CAs seems to be in the fact that there exists some critical 
k beyond which the problem becomes impossible to solve. 

Summary 

In this paper we introduced the identification problem of 
CAs in the context of partial observations. An evolutionary 
algorithm for tackling the problem was presented, and its 
performance was verified for the two ECAs. The initial experi¬ 
ments suggest that the difficulty of the identification problem is 
somehow linked to the dynamical complexity of the CAs. The 
problem and solution algorithm presented in this paper, should 
be considered as one of the first steps in identifying CAs from 
data originating from real-world phenomenon observations. 
Unavoidably, such observations will be somehow incomplete 
in the sense that it is impossible to continuously track the 
involved processes. 
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